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Abstract 



We prove consistency of four different approaches to formalizing the 
idea of minimum average edge-length in a path linking some infinite 
subset of points of a Poisson process. The approaches are (i) shortest 
path from origin through some m distinct points; (ii) shortest average 
edge-length in paths across the diagonal of a large cube; (iii) shortest 
path through some specified proportion S of points in a large cube; 
(iv) translation-invariant measures on paths in M. d which contain a 
proportion S of the Poisson points. We develop basic properties of 
a normalized average length function c(<5) and pose challenging open 
problems. 
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1 Introduction 



Fix dimension d > 2 and let be a Poisson point process of rate 1 per 
unit volume in M. d . "Volume" means d-dimensional Lebesgue measure. 

To start with an analogy, one can informally describe the critical value 
for continuum percolation as the smallest c such that there exists some 
infinite sequence £ji,£j 2 '£j3' • • • of distinct points such that maxj>i — 
£jj < 2c. Here | • | denotes Euclidean distance. What we study in this 
paper can analogously be described informally as the smallest c such that 
there exists some path through an infinite sequence ^ , £ 3 - 2 , £ 3 - 3 , . . . of distinct 
points whose average edge-length linin^oo n _1 Y^i=i ~~ £iJ — c - One 
could formalize this directly by e.g. replacing Zim by Urn sup, requiring the 
property to hold almost surely and then taking the in/ of such c. But such a 
definition seems neither elegant nor convenient. Our purpose in this paper 
is to study four indirect approaches to this formalization question and show 
that they lead to the same constant, which we call c(0+). Along the way 
we introduce a function c(5) which plays a role analogous to the percolation 
function. Our results are collected in Theorem ^ 

Conceptually, this topic seems intermediate between first passage per- 
colation and the random traveling salesman problem (TSP). Regarding the 
former, our c(0+) differs (informally speaking) from a continuum analog of 
the time constant in first passage percolation jSJ |H] because we use "dis- 
tance along a path" in place of "end-to-end Euclidean distance" (nonethe- 
less we use "continuum first passage percolation" as a descriptor of one of 
our approaches below). Regarding the latter, take n random points in a 
(i-dimensional cube of volume n. Let L n (l) be the length of the shortest 
cycle through all n points, i.e. the length of the solution of the TSP. Al- 
most 50 years ago, Beardwood-Halton-Hammersley proved there exists 
a constant c(l) such that EL n (l) ~ c(l)n as n — > oo. Subsequent work on 
related problems is described in the monographs by Steele and Yukich 
|14j . Another of our approaches modifies the TSP by considering cycles 
through some specified proportion 5 of the Poisson points. Additional mo- 
tivation for the current paper comes from work on such problems in the 
"mean-field" setting, described in section l2~2l 

Challenging problems for future research are listed in section |2~T1 One of 
our techniques - subadditive analysis of optimal cost / reward ratios - seems 
potentially applicable in other contexts, as briefly discussed in section 14.11 
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2 The equivalence theorem 



Approach 1 Continuum first passage percolation from the origin. For 
each m > 1 define a random variable T m as the minimum, over choices 
{£n T--, £j m } °f m distinct points of the Poisson process on K d , of 

m 

l^il+El^i-^W-il- 

Comment. One certainly expects that T m /m should converge to a constant, 
but we see no easy argument. In particular we don't see how to apply 
subadditivity arguments directly to study T m . 

Approach 2 Continuum first passage percolation across a diagonal. For 
s > define a random variable W s as the minimum, over all m > 1 and 
all choices . . . , £j m _ 1 } C [0, s] d of m — 1 distinct points of the Poisson 
process, of 

m 
i=l 

where £ io = (0, . . . , 0) and £ jm = (s, . . . , s). 

Comment. Here we can attempt subadditivity analysis, based on splitting 
the cube of side 2s into 2 d subcubes of side s, though because of the "ra- 
tio" form of Q we are not in the usual format for the subadditive ergodic 
theorem. 

Approach 3 TSP on sparse subsets of the cube. Let C n = [0, n 1 / d ] d be the 
cube of volume n in W 1 . Put n random (independent, uniformly distributed) 
points (Cj) into C n . Fix < S < 1. Let L n (8) be the minimum, over all 
choices of cycles (Cji i Cj2 > • • • > Cjn» j Om+i = Cn) through m = \5n\ disjoint 
choices from the random points, of the cycle length Y^hLx IGj+i — Cj'il- 
Comment. Here subadditivity can be applied in familiar ways. Note that, 
in contrast to continuum percolation where definitions are in terms of the 
process on infinite space M. d , the approaches above all envisage taking limits 
over finite regions. For the record we give a final approach which does work 
directly on infinite space, as "the n = oo analog of Approach 3" , though we 
admit it does not seem very useful. 

Approach 4 Translation invariant distributions on infinite paths through 
Poisson points. Consider a locally finite set (x{) of points in M. d , together 
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with a set £ of edges whose endpoints are in (xj), where the edges form 
a collection of doubly-infinite paths, each point of (xi) appearing either 
once or never in the paths. Write S for the space of such points-and-paths 
configurations. The Euclidean translation group acts naturally on S, so one 
can define a probability distribution fi on S to be invariant if it is invariant 
under the action of the Euclidean translation group. Let VW be the set of 
invariant distributions on S under which the distribution of the points 
is the Poisson point process of rate 1. Informally, a \i E M. is just a way of 
collecting some subsets of the Poisson points into paths using a rule which 
doesn't depend on the location of the origin. For /i 6 A4 there is a constant 
5(fi) G [0, 1] specified informally as "the proportion of points which are in 
some infinite path" and defined formally via the formula: for every cube 
C c R d , 

(number of points ^gC which are in some infinite path) = 5(/j,) volume(C). 

Similarly there is a constant £(fi) interpreted as "mean edge-length over 
all edges in the paths of //" and formally via the formula: for every cube 
C C M. d , 

^(length of £ n C) = 5{^)i{n) volume(C) 
(here we regard £ as a subset of M. d ) . Finally define 

c(5) := inf{%) : fj, e M, 5{p) = 5}. 

Theorem 1 

(a) For < 5 < 1 there exists a constant c(5) such that Ln J^ — » c(5) in 
I? as n — > oo. 

(b) The function c(5) is non- decreasing and continuous on (0, 1], the func- 
tion 5c(5) is convex, and the limit c(0+) := lim^o c(5) is strictly pos- 
itive. 

(c) W s — > c(0+) a.s. as s — > oo. 

(d) c(5) = c(S), < S< 1. 

(e) m~ l T m — > c(0+) in probability. 

From (a) we see c(l) is the constant in the Beardwood-Halton-Hammersley 
[2] theorem. Monte Carlo simulations 7] give (for d = 2) c(l) ~ 0.7119 but 
no close rigorous bounds are known. 
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2.1 Discussion 



The function c(5), < 5 < 1 seems worthy of study as a analog of the 
classical percolation function from lattice percolation theory jSJ: 

f(p) := P (origin is in some infinite component) 

in bond percolation with edge-probability p. In particular one can ask 
whether there exists a scaling exponent < a < oo, that is whether 

c(5) - c(0+) x 5 a as 5 j 0. (2) 

For the record we state the (probably very hard) 

Open Problem 1 Prove holds for some a. Or give bounds on the 
possible values of a. 

The next question may be easier. The facts that c(5) is nondecreasing and 
that 5c(5) is convex imply: either 

(i) c(5) is strictly increasing on < 5 < 1; or 

(ii) c(5) is constant on < 5 < 5o, for some < <5o < 1. 
But the latter seems implausible. 

Open Problem 2 Prove that c(8) is strictly increasing on < S < 1. 

Getting reasonable bounds on the numerical value of c(0+) seems difficult. 
Standard methods (comparison with branching random walk: section 13. 3|) 
give an explicit lower bound (jHJ), which in d = 2 is l/(27re) ~ 0.0585. But 
we don't see any simple way to get an interesting upper bound. Even Monte 
Carlo methods seem difficult to code convincingly; for the record we write 

Open Problem 3 In d = 2 study the numerical value c(0+) and the pre- 
sumed scaling exponent a via Monte Carlo methods. 

The variation in which (in d = 2) we restrict to "upward oriented" paths, 
that is edges (xi,yi) — > (xj + i,yj + i) are required to have yi + \ > yi, is easier 
to study via simulation; our small-scale simulations suggest the analog of 
c(0+) in this variation is ~ 0.62, which would be an a priori upper bound 
for our original c(0+). 

Another question concerns variances. Take d = 2 here. For the basic 
traveling salesman problem, that is for L n (l), it is known that var(L n (l)) is 
order n; precisely, 

liminf n _1 var(L n (l)) > 0, limsupn _1 var(L n (l)) < oo. 

n n 
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The upper bound is explicit in Steele ^2] and the lower bound follows from a 
corresponding large deviation lower bound in Rhee On the other hand, 
for first-passage percolation it has long been conjectured 4. that variance 
grows as the | power of expectation, though little has been proved rigorously 

Open Problem 4 Prove var(T m ) X m 2 / 3 . Or just prove var(T m ) = o(m). 

We will not even venture a conjecture for the asymptotic behavior of var(L n (5)). 

As a small rewriting of the definition of L n (5), let L(n,m) be the mini- 
mum, over all choices of cycles (Q 1 , . . . , Qj m , Q m +i = Cn ) through some cho- 
sen m of the random points, of the cycle length YaLi lCji+i ~~ CjJ- Theorem 
^implies that, if m n /n — > sufficiently slowly, then L(n,m n )/m n — > c(0+). 
However it seems plausible this also holds for smaller values of m n . 

Open Problem 5 Prove that L(n,m n )/m n — > c(0+) in probability when- 
ever m n /n — > and lo ^" n — > oo Va < oo. 

In other words, for fixed c < c(0+) consider Poisson points in a cube of 
volume re; is it true that any cycle with average edge-length < c can have at 
most poly-log(n) edges? This would be an analog of the fact that subcritical 
percolation cluster size distribution has a geometrically-decreasing tail 5 . 

2.2 The mean-field model 

Instead of the Euclidean model in this paper, one can consider a "mean- 
field" model on n points for which the (g) inter-point links are assumed 
to have independent random lengths with Exponential (mean re) distribu- 
tion. Within this model one can define the function c(5) analogous to c(S). 
Mezard-Parisi ^Uj used the non-rigorous replica method of statistical physics 
to argue n~ 1 EL n (l) — ► c(l), where c(l) ~ 2.04 is derived from numerical 
solution of a certain fixed point equation. Using probabilistic reformulations 
of these statistical physics ideas, Aldous P gave a (still non-rigorous) anal- 
ysis of the whole function c(5), exhibited in Figure 1 of [Q, which suggests 
c(5) — 5(0+) x <5 1//3 as 5 I 0. So one might conjecture that the scaling 
exponent | also holds in the Euclidean case. 

Note that the subadditivity arguments we use in the Euclidean case to 
prove Theorem ^ rest upon the "boundary effects are negligible" property 
of M. d . In the mean-field model, the limit analog of the Poisson process is 
a certain infinite random tree, for which boundary effects are not negligi- 
ble and subadditivity arguments cannot be used. Indeed, Approach 4 was 
developed in the mean-field setting as a substitute for subadditivity. 
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3 Proofs 



We start with Approach 3 and prove part (a) of Theorem [Tl in l3~Tl then we 
prove part (b) of Theorem HI in l3~2*l and give a lower bound on c(0+) in 13.31 
Sections 13.41 13.51 13.61 contain the proofs of the remaining parts (c), (d), (e) 
of Theorem^ Part (e) seems hardest, for reasons explained at the start of 
section 13.61 

3.1 TSP on sparse subsets 

We rely on subadditivity arguments as in ^] . These monographs de- 
velop general results for sub- or superadditive Euclidean functions satisfying 
regularity properties. Unfortunately functionals like L n {S) lack the mono- 
tonicity property ( |13| equation (3.5)) and it is not clear whether the smooth- 
ness property (^3] section 3.3) is both valid and exploitable. We will use 
the inequality in (b) below as a substitute for monotonicity. 

Write Ai,A2,... for constants depending only on dimension d > 2. We 
start with a purely deterministic lemma (note that by scaling the case of 
general s is equivalent to the case s = 1). 

Lemma 2 Let {x\, . . . , x n } be arbitrary points in the cube [0, s] d and let 
L{m) be the length of the shortest cycle through some m < n of these points. 

(a) [Uniform boundedness] 

L{n) < A l sn( d ~ 1 V d . (3) 

(b) For 1 < mi < m<i < n, 

L(m\) ^ L(m2) sd 1 / 2 

m l ~ m 2 mi 

(c) [Geometric subadditivity] Let k > 2 and let (0 , 1 < j < k d ) be 

the natural partition of [0, s] d into k subcubes of side s/k. Suppose 
that, for each j, there exists a cycle of length lj through some subset 
Sj C {xi} n C J . Then there exists a cycle through UjSj of length at 
most J2j h + A2sk d ~ 1 . 

Proof. Parts (a) and (c) are standard (^S] sections 2.2 and 2.3). For (b), 
let J/i,2/2, • • • ) ym 2 ;2/m2+i = 2/i be a minimum-length cycle attaining Lim^)- 
Then there exists k such that (interpreting k + i modulo 771-2) 

1 f 1 1 . Limy) 
— y, \Vk+i+i ~ Vk+i\ < 

nil — 77^2 
1=1 
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because the right side equals the average of the left side as k varies. To 
make a cycle on {y k , . . . , y k + mi -i} replace edge (yk+rm-i, Vk+mi) h Y ed ge 
(yfc+mi — i, yk), whose length is at most the diameter sd 1 / 2 of the cube [0, s] d . 

Remark. Lemma |2f a) implies that the worst-case cycle length is the 
same order of magnitude as the average-case lengths we will be studying. 
This has the pleasant consequence that events of probability tending to zero 
will make asymptotically negligible contributions to expectation of length, 
and so can be ignored: we use this uniform boundedness property several 
times later. 

We start analysis of the probability model by making definitions to which 
subadditivity arguments can easily be applied. Recall (&) denotes a Poisson 
point process of rate 1 per unit volume in M. d . Let N(s) be the number of 
points of in [0, s] d . Define L(s,6) as the length of the minimum-length 
cycle through at least some |~5iV(s)] of the points of (&) n [0,s] rf . By the 
triangle inequality, replacing "at least |~5iV(s)~|" by "exactly [5iV(s)] points" 
changes nothing. We will start by using the next lemma as a definition of 
c(<5), and later show this agrees with the limit in Theorem ^a). 

Lemma 3 For fixed < 5 < 1 there exists a constant c(S) such that 

— — -, ► do) in L as s — > oo. 

os a 

Proof. First note s — > EL(s,5) is continuous, from the representation 

00 g nd 



EL(s,6)=Y / ^ T e- s sa n (8), 



n=0 

where a n (5) is the expected length of the shortest cycle through some [Sn] 
of n uniform random points in the unit cube [0, l] d . Next, given s > 0, we 
can write any x>0asx = ks + t with integer k and t £ [0, s). Geometric 
subadditivity (Lemma Hfc)) then implies 

EL(x, 5) < k d EL(s + t/k, 5) + A 2 (s + t/k)^- 1 . 

Taking x — > oo while keeping s fixed 

lim sup x ~ d EL(x,5) < inf sup s~ d EL(s + r, S). 

x^oo e>0 T e[0,e] 

So by the continuity property 

lim s~ d EL(s, 5) = inf s' d EL(s, 5) = Sc(S), say (4) 

s^oo s>0 
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with < c(5) < oo. 

Once again by geometric subadditivity, temporarily abbreviating L(s, 5) 
to L s , 

EL\ S < k d EL 2 s + k d {k d - 1)(EL S ) 2 + A 2 sk 2d - 1 EL S + (A 2 sk d - 1 ) 2 . 
By the uniform boundedness property © 

EL 2 < Ajs 2 EV^<f 1 < A\s 2 ■ A 3 s 2id - 1) = A\A^, 

where V^d^ has Poisson(s d ) distribution and where the second inequality 
holds for some A3 for all s > 1. Combining the two displayed inequalities 
and (@J gives 

lim sup x~ 2d EL 2 x = (6c(5)f, 

X 

and finally 

s- d L(s,5) -» 5c(5) in L 2 . 

Remark. More sophisticated modern proofs ( JH] sec. 2.4; ^3] sec. 4.1) 
of the TSP case (6 = 1) use concentration inequalities to obtain almost 
sure convergence; we have not investigated concentration inequalities or a.s. 
convergence for L(s,6). 

Now we proceed with the proof of Theorem ^a) with c(<5) defined by 
Lemma 01 

Proof of Theorem ffl(a). Recall the definition of L n (S). Let C n = 
[0, n l l d ] d be the cube of volume n in M. d . Put n random (independent, 
uniformly distributed) points (Q) into C n . Fix < S < 1. Let L n (S) 
be the minimum, over all choices of cycles (CjiXj2^ ■ ■ ■ Xj m Xj m +i = Cn) 
through any chosen m = \5n\ of the random points, of the cycle length 
|Cii+i — Oil- Again by the triangle inequality, this is the same as saying 
"any m > \6n \ of the random points" . 

Fix small e > and consider, in C n , a Poisson process of rate 1 — e per 
unit volume. By standard properties of the Poisson process, for each n we 
can couple this to the process of n i.i.d. uniform points in C n in such a way 
that, with probability — > 1 as n — > 00, each point of the Poisson process 
is a point of the uniform process. Call this the inclusion coupling. There 
is a similar inclusion coupling between the uniform process on C n and the 
Poisson process of rate 1 + e. 

Now write L\(s,5) to mean the quantity L(s,5) applied to a Poisson 
process of rate A; and write N\(s) for the number of points of that Poisson 
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process in [0, s] d . When the inclusion couplings hold and when 
N l+£ {n 1 / d ) <5n< -^N^ £ (n l,d ) 



l + 2e ^ v ' ~ ~ l-2e 
then we have 

Li + e(n 1/d , j^- £ ) < L n (5) < L^ £ {n l '\ ^). (5) 

This holds because, in informal language, for each < we have more points 
to choose from, and a weaker constraint on minimum number of points in 
the cycle. Now by scaling 

rSs) = (1 - e)~ 1/d W - eyWnW 

and similarly 

Li + e(n 1/d , ifc) = (1 + e)- 1/d L((l + e)- 1 /^, ^). 

Taking limits in (fHjl. using Lemma|3]and the uniform boundedness property, 
we get 

lim sup n^ELniS) < (1 - e)- 1 " 1 /"^^^) 

n 

and similarly 

(1 + e)- 1 - 1 /^^^) < liminf n- 1 ^^). 

Letting e J. and using continuity of c{8) (which we prove independently in 
the next subsection) we see that for < S < 1 

L n{5) , . . ! 

— - > do) m L . 

on 

The case 8 = 1 is similar, but of course is already part of the usual proof 
|13j of the Beardwood-Halton-Hammersley theorem, so we omit it. 

3.2 Properties of c{8) 

Next we prove part (b) of Theorem ^ 

Proposition 4 c{8) is non- decreasing on < 5 < 1. 
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Proof. Fix < Si < S 2 < 1. By Lemma Hfb) 

L(Mi) < L^fe) | sd 1 / 2 



r*ijv( 8 )i - r^ 2 iv( S )i r<5iiv( s )i 

and so 



f<5iJV(a)l/<5i 



5is d 5 2 s d 5is 

where 



\5 2 N(s)]/6 2 

Since i? s is uniformly bounded and R s — > 1 as s — > oo, using Lemma El we 
deduce c(<5i) < c(<5 2 ). 

Proposition 5 £c(<5) is convex on < <5 < 1. 

Proof. Fix < ft < <5 2 < 1 and < A < 1. Let (C j , l<j< k d ) be the 
partition of [0, s] d into k d equal subcubes. Let Nj be the number of points 
of in the j-th subcube and let £j(s) (resp. £j(s)) be the length of the 
shortest cycle through some [JiiVj] (resp. [5 2 iVj-]) points in C 5 . 
Take any 5 < \5\ + (1 — A)<5 2 . The event 

\Xk d ] k d 

j=l j=[Afc d ]+l 

has probability — > 1 as s — > oo, and on this event we have by Lemma I2fc) 

\Xk d ] k d 

L(s,S) < ^ £j(s)+ Yl £j(s) + A 2 sk d - 1 . 

j=l j=\Xk d ]+l 

Taking expectations, letting s — > oo and using Lemma |3] we obtain 

oc(6) < —j^- A ic(5i) H p <5 2 c(d 2 ), 

and letting fc — > oo 

tfc(<J) < A5 lC ((5i) + (1 - A)<5 2 c(5 2 ). (6) 

If (5 = A'fli + (1 - A')<5 2 then © holds for all A < A' and hence for A = A', 
proving convexity. 
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Proposition 6 c(5) is continuous on (0,1]. 

Proof. Convexity of 5c(5) implies continuity of 5c(5), and hence continuity 
of c(8), on the open interval < 5 < 1. Continuity at S = 1 requires a 
separate argument. 

Take 5 < 1. The cycle attaining L n {5) passes through \nS\ points in 
[0, n 1 / d ] d . By Lemmata) there exists a cycle through the remaining q = 
n — \nS\ points with length at most 

A 1 n 1 ' d q( d ~ 1 V i <A 1 n(l-8)V- 1 V d . 

By joining the two cycles we find 

L n (l) < L n (5) + Am(l - 8)W d + 2n 1 / d d 1 / 2 . 

Letting n — > oo 

c(l) < <yc(<5) +Ai(l 
and this implies continuity as 8 f 1. 

3.3 A lower bound on c(0+) 

We start by noting a one-sided bound relating the in Approach 1 to 
c(0+). In the context of Theorem [^a), the cycle attaining length L n (8) can 
be converted into a path from the origin by replacing some edge (£, £') by 
the edge from the origin to C'- It follows that 

limP(A;- 1 r fc < c(8) +e) = 1 Ve > 

and thus 

lim Pik^Tk < c(0+) + e) = 1 Ve > 0. (7) 

fe 

We can now use a standard argument. Consider branching random walk on 
R + , starting with one individual at the origin in generation 0, and where each 
individual in each generation (at position x, say) has children at positions 
[x + j > 1) where (^) forms a Poisson point process of rate 1 in 
W d . Write #&(■) for the mean measure for the positions of the generation-fe 
individuals > 1): 

w=J2 p o r k, i e •)• 
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This is exactly the same measure as the mean measure for lengths of k-step 
paths from the origin through the Poisson points: 

(h,..,jk) V i=2 ) 

where the sum is over ordered distinct A:-tuples. So for T k as defined in 
Approach 1, and for c > 0, A > 

P{T k <ck) < 9 k [0,ck] (Markov's inequality) 



< e Xck 



k 



poo 

/ e~ Xx 9k(dx) (large deviation inequality) 
Jo 

/•oo 
JO 

by the structure of branching random walk. Comparing with (J7| we see 
c(0+) > supjc : infe Ac y e" A|i/| dy < lj . 

Writing Vd for the volume of the unit ball in M. d , 

-X\y\ dv - VdTjd+l) 
a V ~ X d-i ■ 

The inf\>o( • ) is now attained at A = (d — l)/c and we finally find 

c(0+) ^e-V-^MXd+l))" 3 ^ (8) 
and of course Vd = vr rf//2 /r(l + d/2). In particular, for d = 2 we find c(0+) > 

3.4 The limit for W s 

We now turn to Approach 2. Here we consider the rate-1 Poisson process 
on M. d . Let IIo )S be the set of paths ir across the diagonal of C s ; that is, 
of paths 

(0, . . . , 0) = £j , £jj , . . . , Cjm-l > Om = ( S ) • • • ; s ) 

where , . . . , £j m _i } are distinct points of n [0, Write 

1=1 
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for the length and number of edges in the path n and 

w(jr) = l{ir)/m(iv) 

for the average edge-length of tt. 

By considering diagonally- adjacent unit cubes and picking (where pos- 
sible) one point from each, we see there exists a path 7r grccdy € Ilo,s such 
that 

"i(7Tgr C cdy) — 1 has Binomial ( |_sj , 1 — e _1 ) distribution; 

(9) 

^greedy) < A 4 \ S~] 

for some constant A4. Recall the definition 

Wo := mm 



7ren , s m(ir) 
Applying to vr grocdy and using @ we see 

limsupW s < A 4 /(l - e" 1 ) a.s. (10) 

s 

Similarly one can check that s/m(7r grecdy ) is uniformly integrable as s — > 00 
and so 

(Ws,0 < s < 00) is uniformly integrable. (11) 

While subadditivity is not applicable directly to W s , the proof below is an 
easy indirect application. 

Proposition 7 W s — > a.s. and in L 1 as s — > oo ; for some constant 
0<P<oo. 



Proof. For c > define 



xjfl = min (£(tt) — crra(7r) + c). 

' 7rGlIo, s 

We shall prove the Proposition for 

P := sup{c : EX^ S > Vs > 0}. 
For s < t write H s t for the set of paths across the diagonal of [s, t] d and 

(c) (c) 

define X K s l analogously to Xq' s . Given a path ix\ G IIo, s and a path TT2 € 
II Sj t, their concatenation gives a path tt £ Ilot, for which 

e(ir) < £(vri) + £(tt 2 ); m(vr) = m(7Ti) + m(7r 2 ) - 1. (12) 
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In such a concatenation, the last edge of k\ and the first edge of 1T2 ar e 
replaced by a single edge and the inequality for l(ir) arises only from the 
triangle inequality for this replacement. 

(c) 

Consider first a value c such that EXq j < for some s. For this s let 

7r be the random path in IIq )S such that E(£(tt) — cm(n) + c) = EXq s < 0. 
So by the concatenation property (|12j) and the strong law of large numbers 
we can construct random paths in IIq & s such that 

£(7T k ) E£(tt) 
hm sup — -. — r < — — t ~r < c a.s. 

fc^oo "i(7Tfe) - 1 £m(7r) - 1 

and it easily follows that 

limsup W s < c a.s. 

From choice of c and definition of (3 we deduce 

limsup W s < j3 a.s. 

On the other hand consider a value of c such that -E.Xq s — ^ f° r au s > 0. 

The process (-X^) is subadditive by p2|). A routine application of the 
subadditive ergodic theorem shows that there exists a constant 7(c) > 
such that 

lim s~ x X^\ = 7(c) a.s. (13) 
Now W s = £(TT s )/m(TT s ) for some random tt s in U s , and 

X$ < £(tt s ) - cm(vr s ) + c, 



implying 



4i - c 

W s -c>^f^. (14) 



Combining l|13j) with ((11]) gives 



liminf(iy s — c) > 7(c) • liminf — - — - > a.s. 

s s m(ir s ) 



and consequently 



liminf Ws > c a.s. 

s 
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From choice of c and definition of (5 we deduce 

liminf W s > (5 a.s. 

s— >oo 

completing the proof of a.s. convergence in Proposition [7| Finally, note 
j3 < oo by (|1U|1 . and note that L 1 convergence follows from 

We use concatenation constructions based on ()12() several times later, 
and here is a rather technical formulation of the results of such constructions, 
designed to replace the use of the expectations E£(tt s ) and Em(ir s ) by using 
truncation. By "adjacent cubes" we mean disjoint cubes in which we can 
choose diagonals to form a connected path (e.g. as in Figure 1 below). 

Lemma 8 (Concatenation argument) Let ir s be a random path across 
the diagonal of the cube [0, s] d , and let wq G (0,s). Then there exist paths 
7r sn through the Poisson points in n adjacent cubes of side s each, such that 

hmsupu>(7r sn ) < w + , — — r a.s. (15) 

n (sd l / 2 /wo - l)P(w(ir s ) < w ) 

and 

Um m(TT sn ) ^ , sd i/2, o _ i\pr w (Tr a ) < w ) a.s. 

n n 

Proof. Given a random path ir s across the diagonal of the cube [0,s] rf , 
consider the modified path 



7T. 



ir s on the event w(ir s ) < u>o, 
f * otherwise, 



where / s is a shortcut path consisting of a single edge from (0, . . . , 0) to 
(s, . . . , s). By concatenating n independent copies of n s we get a path ir ns 
such that, using (fT2|) . 

E^) woEm^ + sd^k 

^Ks) < " < — 

E ™(^°) " (" " 1) E ™(#) - (n - 1) 

u; (n - 1) + sd 1 / 2 *: 
< ™o + - (16) 

Em(7ri j) )-(n-l) 

2=1 
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where k = #{i : w(tTs) > wo . By definition of tt s either m(7r s ) > sd 1 / 2 /wq, 
or m(Tr s ) = 1, so that 

n 

m(ir sn ) = J^m(7fW) - (n-1) > (n- fe)(sd 1 / 2 >o) + A;- (n — 1) 

> (n-fe)(sd 1 / 2 >o-l)- (17) 

Since fc/n — > P(w(7r s ) > wq) a.s. as n — ► oo, combining l(TB|) with (JT7|) gives 
(|15(1 and the second inequality of the lemma follows from (|17|) . 

Proposition 9 /3 = c(0+). 

Proof. Given a cycle through m points in [0, s] d , one can make a path from 
(0, . . . , 0) to (s, . . . , s) through these m points using extra length at most 
2sd l l 2 . So for any 5 > we have, setting s = n 1 ^ and m = [<5rc] , 

w L n (<5) + 2^/^1/2 



[<5nl + 1 

Letting n — > oo and using Proposition [7| and Theorem ^a), we see < c(5). 
So < c(0+). 

For the converse, consider the cube [0, Ks] d for even ET, partitioned into 

rl (i) 

K equal subcubes. In each subcube choose a diagonal path ir s attaining 
the minimum W s , so that concatenating these paths forms a cycle through 
the large cube (see Figure 1). 

Fix e > 0. By Proposition [3 for s = s(e) large enough the random paths 
7Ts satisfy 

P(w(ttW) >/? + £)<£. 

Then by Lemma |H] applied with wo = /3 + e we can construct cycles ir K d in 
[0, Ks] d such that 

/ R _|_ e _|_ sd 1 / 2 ^ \ 

P (^(^ > /? + e + (1 _ £)N1/2/(/3 + g) _ 1} + e j -+ as K -+ oo. 

(18) 

By taking s sufficiently large this simplifies to 

P(w(Tt K d) > + 2e + 20e) -> as K -> oo. (19) 
Also from Lemma ^] there exists <5 S > such that 

P( !! fe 1 > 5s) - 1 as K - co. 
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Figure 1: A cycle built from subcube diagonals in the 6x6 cube 



On this event we have, for 5 < 5 S , 

L(Ks,5) < w{-K K d)m{'K K d). 

So as K — > oo 

P(L(Ks, 5) < (/? + 2e + 2(3e)s d K d 5 s ) -> 1 
and then by Lemma 

Sc(S) < ((3 + 2e + 2(3e)5 s . 
Letting 5 | 5 S then implies 

c(0+) <(3 + 2e + 2/3e, 
so that letting e [ gives the desired inequality 

c(0+) < /3. 

Propositions [7| and |H1 establish part (c) of Theorem ^ For later use we 
record a small variation. 

Lemma 10 (linear diagonal percolation) For r] > let 



mm 



7rGlIo, s :m(7r)<?7S m(7Tj 
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be the minimum average edge-length over paths across the diagonal of [0, s] d 
through at most rjs points. Then there exists a function (3{rf) such that 

lim W s (j?) = P{rf) a.s., 
(3(rj) i c(0+) as r/ f oo. 

Proof. The subadditivity argument used in Proposition[7|applies unchanged 

in) 

to Ws , giving 
increasing, and 



to Ws , giving the first limit. The function r\ — > f3{rf) is a priori non 



lim Mr]) >(3 = c(0+) 



by Proposition |SJ 
Given s < oo let 



7r s be a path attaining W s 

tt' s be a path attaining Ws for r\ = 2s d ~ 1 . 
Given e > we can choose s = s(e) < oo sufficiently large that 

P(tt' s ^tt s )<e 

(because ns = 2s d will likely exceed the Poisson(s d ) number of points in 
[0, s] d ) and 

P(w{tt s ) > c(0+) + e) < e 
by Propositions [7| and |5J So 

P(w(ir' s ) > c(0+) +e) < 2e. 

Applying Lemma El to 7^ and = c(0+) + e gives the upper bound in 



,^u- w , c(0+) + e + sd^2e 

sd 1 / 2 



0(2s d ~ l ) < lim sup < c(0+) + e + 



and the lower bound follows from the construction in Lemma 1121 Taking 
e — ► 0, so that s = s(e) — ► oo, 

lim /5(r?) < c(0+). 
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3.5 Translation invariant distributions in the infinite model 

Here we prove part (d) of Theorem^by relating Approach 4 to Approach 3. 
Fix < 5 < 1 and recall denotes a Poisson point process of rate 1 per 
unit volume in Recall from section Td. II the definition of L(s,S). Write 
£(s,5) and V(s,S) for the edge-set and the vertex-set of a cycle attaining 
length L(s,5). For a cube C, vertex-set V and edge-set £ , write |V PI <C| for 
the number of vertices of V in C, and write len(£ PI C) for the total length 
of edges of £ restricted to C. 

Write U s for a uniform random position in [0, s] d , and write £^ = £i — U s 
for the positions of the Poisson points relative to the "random origin" U s ; 
then write £(s, 8) for the corresponding set of relative positions of edges of 
the tour attaining L(s,S): 

£{s,S) ={(g\$)): ^i,^)e£(s,6)}. 

Now the pair (^(^),£(s,5)j takes values in the space S* of point-sets and 
paths defined as the space S in section 12 except that for S* we allow cycles 
in addition to doubly- infinite paths. 

There is a natural metric topology on S* obtained by regarding it as 

~(s) 

a space of marked point processes. For each s the point process (Q ) is 
exactly a Poisson process. By letting s — > oo through some subsequence we 
can define a limit 

($%£(s,5)) 4 m),£(5)) onS* 

where £{5) is an edge set on some subset V{8) of vertices of the Poisson 
point process It is easy to check that the edge-set forms doubly-infinite 
paths (rather than finite cycles) and so the right side has some distribution 
on S. From the uniform distribution for U s it is easy to check that \i is 
translation invariant. We will show (in the notation of section |2J) 

6(im) = 6; £(h)<c(5) 

which immediately implies c(S) < c(6). With a little extra effort we could 
prove l(pi) = c(S), but this doesn't help. 

From the fact that V(s, 6) contains exactly [<5iV(s)] of the N(s) Poisson 
points in [0, s] d , it is clear that for fixed r > 

lim E\V(s,S) n (U, + Cr)\ = 5r d 

s— >oo 
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and therefore 

E\V(S) nC r | = Sr d , 

the interchange of limits being justified by the fact that the total number of 
Poisson points in U s + C r has fixed distribution. This tells us that 5(/i) = 5. 
Now fix b < oo and let £ b (S) and £ b (s, 5) be the subsets of £{5) and £(s, 5) 
obtained by taking only edges of length < b. By translation invariance 

Elen(£ b (5) nC r ) = 5G{b)r d 

for some G(b), and then 

/•oo 

1(H) = 5 / bdG{b). 
Jo 

By weak convergence, letting s — > oo through a subsequence, 

Elen(£ b (5) n CO = lim£len(5 ft (s,$) n (E7, + Ci)) 
= lima -d .Elen(£ 6 (a,5)). 

Since <5c(#) = lim s s~ d Elen(£(s, 5)), we can let b — > oo and use Fatou's 
lemma to conclude t{n) < c(8). 

For the converse, let \x be a translation invariant distribution attaining 
c(S). That is, /x specifies an edge-set £ on a vertex-subset V C such that 

(i) E\VnC r \ = 5r d Vr>0; 

(ii) £^ len(£ n C r ) = 5c{5)r d Vr > 0. 

Now consider large s and small r] > 0. The intersection of £ and C s consists 
of a set of paths, each of which enters C s at some point on some face, 
and exits at some point on some face. Consider the subset of paths which 
intersect [rjs, (1 — rj)s] d , and write £ s ' v for the edges in this subset, truncating 
an edge which crosses a face at the crossing point. We extend this edge-set 
into a cycle in C s as follows. Suppose the number of crossing points on 
each face is some even number (otherwise add superfluous edges, making no 
asymptotic difference); let D be the total number of crossing points. On 
each face create a tour (in the face) of the crossing points; by Lemma [2J a) 
this has length at most AisD^ -2 ^^ 1 ^. Within each such tour replace 
alternate edges by double edges. The collection of these within-face edges, 
and the paths through the interior of C s , form a connected graph where 
each vertex has even degree, so we can find a Eulerian cycle; write T s ^ for 
the edges in this cycle. So 

EleniF^) < Sc(S)s d + (Ad)A x s {ED)^ 1 ^ . (20) 
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Because each crossing point is associated with some path-segment of length 
> rjs from the face of C s to the face of [rjs, (1 — rj)s] d , we have 



Drjs < len(£ n 1 



and so by (ii) 



ED < (r ]S y 1 5c(5)s d . 
Substituting into (|2fl|) gives 

limsups-^len^ 5 ''') < 5c(6). 



(21) 



Now the cycle J- S,r> goes through all the vertices V D [rjs, (1 — r])s] d . By (i) 



E 



V n [r]s, (1 - r])s] d = 5(1 - 2 V ) d s d 



and by comparison with the total number of Poisson points in C s we get a 
crude bound 



var 



V n [r]s, (1 - r])s] d < s d (s d + 1) < 2s M [s > 1) 



,,2d 



Now consider k > 2. Using geometric subadditivity (Lemma Hfc)) and 
independent copies of T s,ri on each of the k d subcubes, we can construct a 
cycle g ks,v in [0, ks] d such that extends to 



limsup(ks)- d Elen(g ks ' r >) < Sc(S) 



Chebyshev's inequality implies 

P( number of vertices in Q ks * < S(ks) d ({l - 2r]) d - rj) 



(22) 



< 



2k a s 



dJld 



(5k d s d r]Y 



25- 2 V - 2 k- d . 



Taking k = k(s) — > oo sufficiently slowly, (|22[) remains true, so by definition 
of c(-) and the uniform boundedness property 

5{(1 - 2rf) d - 2t/) c(*((l - 2r]) d - 2rj)) < 5c(5). 

Letting ij — > and using continuity of c(-) shows c(<5) < c(<5). 
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3.6 The limit for T m 



Recall the definition (Approach 1) of T m . If we could prove a sublinear 
growth property, that the optimal path stays within a ball of radius o(m), 
then part (e) of Theorem ^ would follow easily from the other parts. Be- 
cause we cannot prove this property we use a more circuitous method which 
eventually (Lemma 114(1 compares a general path from the origin with either 
a cycle through the origin (Lemma I13() or a path between diagonals of a 
cube (Xemma IT2*|) . 

Recall Lemma EI] ° n linear diagonal percolation. We need a stronger 
result. Given a realization of the Poisson point process in [0,s] d , first remove 
some fis points, then look for the optimal diagonal path through at most ns 
of the remaining points. We show that the same constant (3{rj) appears in 
the limit. 

Lemma 11 Given the points of the Poisson process in [0, s] d , for any r\ > 0, 
fjL > let 

:= max min 

A 7r:7rnA=0 m(lT) 

where the maximum is taken over all subsets A of points with size at most 
fj,s and the minimum over all paths tt € IIo s through at most rjs points that 
exclude A. Then 

lim = 0(rj) a.s. 

s—>oo 

Proof. We consider the two-dimensional case. The same proof works for 
dimension d > 3 without significant changes. 

Consider the square [0, k 2 s] 2 , partitioned into subsquares of side s. Con- 
sider the 2k + 1 subsquares (Si, 1 < i < 2k + 1) whose lower left corners 
are (k, 0), (k, 1), (k — 1, 1), (k — 1, 2), ... , (1, A;), (0, k). For each i consider the 
"staircase" consisting oik 2 — k diagonally-adjacent subsquares starting with 
Si. Note that all these subsquares are distinct. See Figure 2. 

For each such staircase consider a joint path ni , obtained by concate- 
nation of k 2 — k paths each attaining Ws^ in its own subcube. 

Given n > and e > 0, using Lemma |H1 and Lemma ^] we can choose 
large s so that 

lim sup w(-7r^) < f3(rj) + e, 

k 

thus with probability — > 1 as k — > oo we can choose at least 3k/ 2 staircases 
i satisfying 

m(4 i} ) < Vk 2 s, w{tt^) < /3(rj) + 2e. (23) 
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Figure 2: 2k + 1 diagonal paths in a large square 

For any subset of points A of size at most \ik 2 there are at least k staircases 
containing no more than \ik points of A each. Thus with probability 1, for 
all sufficiently large k and all such A we can find some 7r? satisfying (|23|) 
and 

(i) 

ir k contains < \ik points of A. 

(i) 

From such a path ir k we create a path fr^ G Yl 0k 2 s by first deleting the 
points in A and then adding two edges from the path-ends to the corners 
to [0, k 2 s] d . These changes have asymptotically negligible effect on average 
edge-length, so we have shown 

lim < 0(ri) + 2e a.s. 

fc^oo K s 

and the result follows easily. 

Next we will use Lemma^Jto show that when one weakens conditions on 
Wg^ by allowing paths through points outside the cube [0, s] d , this doesn't 
affect the limit (3{r]). 

Lemma 12 (unbounded linear percolation) Let Ws be the minimal 
average step length over all paths from (0, ...0) to (s, ...s) through some points 
of a rate 1 Poisson point process in M. d , with at most ns steps. Then 

limW^ = (3(r]) in probability. 

s 

Proof. Fix rj and consider the possibility that, for some (5' < (3{rf) 

lim sup P{W^ < 13') > 0. (24) 

s 
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It is enough to show this cannot happen. 

~ (v) 

Let n s be the optimal path for W s ■ There exists a constant p such that 
7Tg is contained inside [— ps,s + ps] d a.s. for large s. In the cubes [— ps,0] d 
and [s, s + ps) d take two diagonal paths tt*, 7r;? with no more than r\p steps 
each, that attain minimal step length while avoiding the points of tt s . By 
Lemma ^2 the average edge- length in nl and in 7r^ is asymptotically f3{rf). 
Then the concatenated path {irl, ir s , ir^} satisfies the constraints in Lemma 
EI] for a diagonal path across and within the large cube [— ps, (p + l)s] with 
at most rj(l + 2p)s points. On the event in (|24|) this path has average step 
length less than some /3" < 0(rj). So (|2*1)) contradicts the conclusion of 
Lemma 1101 

Lemma 13 (free cycle) Let L m (0) be the minimum length over all cycles 
on m points of the Poisson process that pass through the origin. Then 

lim L m (0)/m > c(0+) in probability. 

m— >oo 

Remarks. Such a cycle contains two edges from the origin to points of the 
Poisson process. Our results ultimately imply the inequality is really an 
equality, but we won't need to prove that now. 
Proof. Assume, to get a contradiction, that for some c < c(0+) 

limsupP(L m (0)/m < c) > 0. (25) 

m— >oo 

Let p > be a constant such that the cycle ir m attaining length L m (0) is 
contained inside a box [— pm, pm] d a.s. for large m. 

Fix -q > 0. By Lemma ITT1 it's possible to choose two paths 7r^ and 7r^ 
across the diagonals of [— pm, 0] and [0,pm] d respectively in such a way 
that 7r^, 7r^j have no common points with 7r m except the origin, and 

n\ < r)pm, l\ = j3(rj)n\ + o(m) 

n 2 < npm, l 2 = (3(n)n2 + o(m) 

where m, ri2 and l\, I2 denote the number of edges and the length of ir^ 
and 7T^. 

One can adjust a few edges around the origin to get a concatenated path 
{tt^j, 7T m , 7r^} with length L' < l\ + I2 + L m (0) and with n' = n\ + n 2 + m — 2 
steps, across the diagonal of the whole cube [— pm, pm]. Observe 

n 1 +n 2 + m < + J_ 
2pm ~ 2p 
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On the event {L m (0)/m < c} in <j2~5|) we have 

h + h + L m (0) o(m) + L m (0) -P(rj)m 

P(V> S 

n i + n 2 + m n\ + ri2 + m 

o(m) + (c — (3(n))m 



< 



n\ + ?i2 + m 



< ^M + (i)< c - c(0+) +0 (i) 

- 1 + 2np w - 1 + 2np K ' 

the final two inequalities because c — /3(r/) < c — c(0+) < 0. Because of (l2*H|) 
we can apply Lemma E3 and deduce 

But this implies lim^oo /3(r/) = — oo, which is impossible. 
We can finally prove part (e) of Theorem^ 

Lemma 14 (free path) Let T m be the minimal length over all paths through 
m points of the Poisson process starting at the origin. Then 

\\mT m /m = c(0+) in probability. 

m 

Proof. The upper bound 

limP^^Tfc < c(0+) + e) = 1 Ve > 

k 

was given at (0). For the lower bound suppose, for some c < c(0+) 

lim sup P(T m /m < c) > 0. (27) 

m 

Let 7r m be a path attaining length T m and let x m be the position of its 
end-point. Consider 

r* = sup{r : limsupP(|x m [ > rm,T m < cm) > 0}. 

m 

Clearly r* < oo. Suppose first = 0. Then we can construct a free cycle 
from 7r m by adding an edge from x m to the origin, and such cycles have 
average length < c + o(l), contradicting Lemma 

Alternatively suppose > 0. Then using rotational invariance of the 
Poisson process, for arbitrary e > 

limsupP(x m G [(r — s)m, (r + e)m] rf and T m < cm) > 0. 



26 



Adding an edge from x m to ((r+e)m, . . . , (r+e)m) gives a path between the 
diagonal corners of [0, (r + s)m] d through m points with average edge length 
at most c + 2ed 1 / 2 , which contradicts Lemma ED when e is small (because 
fi{rf) > c(0+)). So (|2Tj) is false and we have proved T m /m — > c(0+) in 
probability. 

4 Final Remarks 

4.1 Subadditivity and cost-reward problems 

The technique in section l3~4l seems applicable in many contexts where sub- 
additivity is used. For instance, to modify the context of first-passage per- 
colation on the lattice, suppose that for each edge e there is a "cost" c(e) 
and a "reward" r(e), so that for each path ir = (e\.e2, • • • , e n ) there is a cost 
and a reward (c(tt), r(-7r)) = c(ej), r ( e «))- Then one can study by the 
same technique 

^4 n := minc(7r)/r(7r) 

7T 

minimized over paths across the diagonal of [0, n] d . Developing a general 
theorem which establishes limits n~ 1 A n — > ao a.s. in such settings would be 
a natural research project. 
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